19 results found.
Language Type:
Multilingual
Languages:
Basque Catalan Galician Portuguese Spanish
Availability:
From Owner
License:
Part of the data will be distributed through LDC (not yet)
Size:
23.8 GByte Production Status:
Newly created-finished
Use:
Language Identification
-
Paper title:KALAKA-3: a database for the recognition of spoken European languages on YouTube audios
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Luis Javier Rodriguez-Fuentes | University of the Basque Country UPV/EHU | ES | ||
| Author 2 | Mikel Penagarikano | University of the Basque Country | ES | ||
| Author 3 | Amparo Varona | University of the Basque Country | ES | ||
| Author 4 | Mireia Diez | University of the Basque Country | CZ | ||
| Author 5 | German Bordel | University of the Basque Country | None | University of the Basque Country | ES |
| Main Contact | Luis Javier Rodriguez-Fuentes | University of the Basque Country UPV/EHU | None |
Documentation:
Albayzin 2012 Language Recognition Evaluation Plan (http://iberspeech2012.ii.uam.es/images/PDFs/albayzin_lre12_evalplan_v1.3_springer.pdf)Language Type:
Multilingual
Languages:
Basque Catalan English Galician Portuguese
Availability:
From Owner
License:
<Not Specified>
Size:
14 GByte Production Status:
Newly created-finished
Use:
Language Identification
-
Paper title:KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Luis Javier Rodríguez-Fuentes | University of the Basque Country | None | Euskal Herriko Unibertsitatea | None |
| Author 2 | Mikel Penagarikano | University of the Basque Country | None | Euskal Herriko Unibertsitatea | None |
| Author 3 | Amparo Varona | University of the Basque Country | None | ||
| Author 4 | Mireia Diez | University of the Basque Country | None | ||
| Author 5 | German Bordel | University of the Basque Country | None | University of the Basque Country | ES |
| Main Contact | Luis Javier Rodríguez-Fuentes | University of the Basque Country | ES | University of the Basque Country UPV/EHU | ES |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Basque Catalan Galician Portuguese Spanish
Availability:
Freely Available
License:
Creative Commons
Size:
65K sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:TweetMT: A Parallel Microblog Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Iñaki San Vicente | Elhuyar Foundation / IXA - UPV-EHU | ot |
| Author 2 | Iñaki Alegria | University of the Basque Country (UPV/EHU) | ES |
| Author 3 | Cristina España-Bonet | Universitat Politècnica de Catalunya -- BarcelonaTech | ES |
| Author 4 | Pablo Gamallo | CITIUS, University of Santiago de Compostela | ES |
| Author 5 | Hugo Gonçalo Oliveira | CISUC, University of Coimbra | PT |
| Author 6 | Eva Martinez Garcia | TALP Research Center | ES |
| Author 7 | Antonio Toral | Dublin City Unversity | IE |
| Author 8 | Arkaitz Zubiaga | University of Warwick | GB |
| Author 9 | Nora Aranberri | University of the Basque Country | ES |
| Main Contact | Iñaki San Vicente | Elhuyar Foundation / IXA - UPV-EHU | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Galician
Availability:
From Owner
License:
<Not Specified>
Size:
18000000 tokens Production Status:
Existing-updated
Use:
Language Modelling
-
Paper title:Developing New Linguistic Resources and Tools for the Galician Language
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 2 | Xavier Gómez Guinovart | Universidade de Vigo | ES |
| Author 3 | German Rigau | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 4 | Miguel Anxo Solla Portela | TALG Research Group, University of Vigo | ES |
| Main Contact | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Galician
Availability:
From Owner
License:
<Not Specified>
Size:
600 hours Production Status:
Existing-updated
Use:
Language Modelling
-
Paper title:CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Carmen Garcia-Mateo | University of Vigo | ES |
| Author 2 | Antonio Cardenal | University of Vigo | ES |
| Author 3 | Xose Luis Regueira | University of Santiago de Compostela | ES |
| Author 4 | Elisa Fernández Rei | University of Santiago de Compostela | ES |
| Author 5 | Marta Martinez | University of Vigo | ES |
| Author 6 | Roberto Seara | University of Vigo | ES |
| Author 7 | Rocío Varela | University of Vigo | ES |
| Author 8 | Noemí Basanta | University of Santiago de Compostela | ES |
| Main Contact | Carmen Garcia-Mateo | University of Vigo | None |
Documentation:
User Manual in Galician
Written
Corpus,
Language Type:
Trilingual
Languages:
Galician Portuguese Spanish
Availability:
Freely Available
License:
GPL
Size:
1 MByte Production Status:
Newly created-in progress
Use:
Person Identification
-
Paper title:Multilingual corpora with coreferential annotation of person entities
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marcos Garcia | University of Santiago de Compostela | ES | Universidade da Corunha | ES |
| Author 2 | Pablo Gamallo | University of Santiago de Compostela | None | University of Santiago de Compostela | ES |
| Main Contact | Marcos Garcia | Universidade da Corunha | None |
Documentation:
In progress
Written
Corpus,
Language Type:
Multilingual
Languages:
Galician
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International Public License (CC BY 4.0)
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:Developing New Linguistic Resources and Tools for the Galician Language
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 2 | Xavier Gómez Guinovart | Universidade de Vigo | ES |
| Author 3 | German Rigau | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES |
| Author 4 | Miguel Anxo Solla Portela | TALG Research Group, University of Vigo | ES |
| Main Contact | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Albanian Arabic Basque Bulgarian Catalan Chinese Croatian Danish Dutch English Finnish French Galician Greek Hebrew Icelandic Indonesian Italian Japanese Lithuanian Malay Norwegian Persian Polish Portuguese Romanian Slovak Slovene Spanish Swedish Thai
Availability:
Freely Available
License:
Multiple Licenses
Size:
1072646 synsets Production Status:
Existing-used
Use:
All of the above
-
Paper title:Some Issues with Building a Multilingual Wordnet
-
Paper track:Infrastructural Issues/Large Projects/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | John P. McCrae | Open Multilingual WordNet | /N |
Documentation:
None
Written
Evaluation Package,
Language Type:
Multilingual
Languages:
English Galician Portuguese Spanish
Availability:
Freely Available
License:
GPLv3
Size:
5,3 MByte Production Status:
Newly created-in progress
Use:
Named Entity Recognition
-
Paper title:Incorporating Lexico-semantic Heuristics into Coreference Resolution Sieves for Named Entity Recognition at Document-level
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marcos Garcia | University of Santiago de Compostela | ES | Universidade da Corunha | ES |
| Main Contact | Marcos Garcia | Universidade da Corunha | None |
Documentation:
README.txt




